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Abstract 

In this work, we focus on several completion problems for subclasses of chordal graphs: 
Minimum Fill-In, Interval Completion, Proper Interval Completion, Threshold 
Completion, and Trivially Perfect Completion. In these problems, the task is to add at 
most k edges to a given graph in order to obtain a chordal, interval, proper interval, threshold, or 
trivially perfect graph, respectively. We prove the following lower bounds for all these problems, 
as well as for the related Chain Completion problem: 

• Assuming the Exponential Time Hypothesis, none of these problems can be solved in time 

20(ni/Viog‘=n) QJ. 20(fc'/Viog‘=fc) . „0(i)^ integer c. 

• Assuming the non-existence of a subexponential-time approximation scheme for Min Bisection 
on d-regular graphs, for some constant d, none of these problems can be solved in time 

2o(n) Qj. 2°i^) -n^W, 

For all the aforementioned completion problems, apart from Proper Interval Completion, 

FPT algorithms with running time of the form • n^C) are known. Thus, the second 

result proves that a significant improvement of any of these algorithms would lead to a surprising 
breakthrough in the design of approximation algorithms for Min Bisection. 

To prove our results, we use a reduction methodology based on combining the classic 
approach of starting with a sparse instance of 3-Sat, prepared using the Sparsification Lemma, 
with the existence of almost linear-size Probabilistically Checkable Proofs (PCPs). Apart 
from our main results, we also obtain lower bounds excluding the existence of subexponential 
algorithms for the Optimum Linear Arrangement problem, as well as improved, yet still 
not tight, lower bounds for Feedback Arc Set in Tournaments. 


*St. Petersburg Department of Steklov Institute of Mathematics. E-mail: iabliznets@gmail.com. Partially 
supported by the Government of the Russian Federation (grant 14.Z50.31.0030), by the Grant of the President of the 
Russian Federation (MK-6550.2015.1) as well as by Warsaw Genter of Mathematics and Computer Science. 

Hnstitute of Informatics, University of Warsaw, Poland. E-mail: cygan@mimuw.edu.pl. Partially supported by 
the Polish National Science Gentre grant DEG-2012/05/D/ST6/03214. 

^Institute of Informatics, University of Warsaw, Poland. E-mail: kompaw01@gmail.com. 

^DIMAP and Department of Computer Science, University of Warwick, United Kingdom. E-mail: 
lukas.mach@gmail.com. Received funding by the European Research Council under the European Union’s Seventh 
Framework Programme (FP7/2007-2013)/ERC grant agreement no. 259385. 

'institute of Informatics, University of Warsaw, Poland. E-mail: mp248287@mimuw.edu.pl. Supported by the 
Polish National Science Centre grant DEC-2013/11/D/ST6/03073 and Foundation for Polish Science via the START 
stipend programme. During the work on these results, Michal Pilipczuk has been holding a post-doc position of 
Warsaw Center of Mathematics and Computer Science. 



1 Introduction 


In the Minimum Fill-In problem, also known as Chordal Completion, the input is an undirected 
graph G and an integer k, and the question is whether at most k edges can be added to G in order 
to turn it into a chordal graph, i.e., a graph without induced cycles of length at least 4 (also 
known as holes). The interest in this problem originates in the study of strategies for Gaussian 
elimination on sparse matrices, because the optimum number of additional entries of a matrix A 
that become non-zero during the elimination is tightly connected to the minimum fill-in of the 
graph Ga obtained by taking A to be its adjacency matrix. See [Enn] for more information on 
applications of Minimum Fill-In in the theory of sparse matrices. However, the problems of adding 
as few edges as possible to obtain a chordal graph, or a graph belonging to some natural subclass of 
chordal graphs, like interval, proper interval, trivially perfect, or threshold graphs, have numerous 
other applications ranging from database management, bioinformatics, artificial intelligence, to 
social networks. We refer to the introductory sections of [6l|71[T8l[T9l|2Tl|25l|32ll33] for a broader 
discussion and pointers to relevant literature. 

Minimum Fill-In is NP-hard, as shown by Yannakakis [Hj, however the reduction showing 
this is quite non-obvious; the complexity status of the problem was among the 12 open problems at 
the end of the hrst edition of the Garey and Johnson’s book [26]. The study of Minimum Fill-In 
from the point of view of the parameterized complexity, with k being the obvious parameter of 
interest, started with the pioneering work of Kaplan et al. [32]. They proposed a fixed-parameter 
(FPT) algorithm with running time 0{16^k^ k‘^mn) that is based on locating holes in the graph 
and branching on possible ways of adding edges to get rid of them. A similar strategy worked also 
for Proper Interval Completion [32], the problem of adding as few edges as possible to obtain 
a proper interval graph, but not for INTERVAL Completion. The fixed-parameter tractability of 
the latter has been resolved by Villanger et al. [43j only several years thereafter. 

A complete turning point came four years ago, when Fomin and Villanger [25] presented an 
algorithm for Minimum Fill-In with subexponential parameterized complexity, more precisely with 
running time +k‘^nm). This was an immense surprise to the parameterized complexity 

community, since subexponential parameterized algorithms, i.e., with running time 2°^^^ • 
were known essentially only in two restricted settings: in topologically constrained graphs via the 
technique of bidimensionality (see e.g. |16jl. and in tournaments, with an important example of 
Feedback Arc Set in Tournaments [3l[23l[33]. For most natural parameterized problems, the 
existence of such algorithms can be excluded under the Exponential Time Hypothesis, which (when 
combined with the Sparsification Lemma IM]) essentially states that there is no algorithm for 3-Sat 
with running time ; cf. [T4[[24] . 

The work of Fomin and Villanger [25] presented a conceptual breakthrough in the approach 
to completion problems for subclasses of chordal graphs. The main idea is not to focus on 
breaking individual obstacles such as holes by single edge additions, as was proposed in the 
previous works, but to build a structural decomposition of the completed graph by means of a 
dynamic programming algorithm that minimizes the number of edges missing in the decomposition. 
The crux is to show that this dynamic programming can be restricted to a space of states that 
has size • and can be enumerated efficiently. In case of Minimum Fill-In, 

the considered decomposition is the clique tree, but this generic approach can be in principle 
applied to every subclass of chordal graphs whose graphs have a well-defined global structure. And 
so, following [25], sub exponential parameterized algorithms have been designed for Threshold 
Completion [20], Trivially Perfect Completion [20], Proper Interval Completion [7], 
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and even Interval Completion [6]. Apart from Proper Interval Completion, for all these 
problems the algorithms have running time ; for the former we are currently stuck at 

exponent log k), but this is conjectured to be an artifact of the technique [7]. Let us remark 

that in each of these cases the idea of Fomin and Villanger only provides the basic outline of the 
strategy, while the actual implementation is always class-specific and requires involved technical 
insight. 

Drange et al. [laiso] complemented their results with a number of lower bounds suggesting that 
even a slight deviation from the setting of adding edges to a subclass of chordal graphs leads to the 
non-existence of a subexponential parameterized algorithm under the assumption of ETH. Thus, the 
aforementioned problems for which such algorithms exist are in fact intriguing “singularity points” 
on the complexity landscape. It is interesting that these singularity points actually correspond to 
problems that have important practical applications. Hence, the study of this phenomenon is an 
important direction that naturally belongs to the optimality programme (cf. |35ll38] i: a trend in 
parameterized complexity that focuses on systematic study of parameterized problems by providing 
possibly tight upper and lower bounds on their complexity. 

From this point of view, the first natural question is exactly how “deep” are the aforementioned 
singularity points. More precisely, is the running time of the form optimal, say 

under ETH? For subexponential algorithms derived using bidimensionality, the non-existence of 
. n^^^^^-time algorithms under ETH usually follows already from the known NP-hardness 
reductions [MllM!. However, for the considered completion problems this is not the case: as Fomin 
and Villanger already observed in [25], the classic NP-hardness reduction for Minimum Fill-In 
gives only a lower bound. A similar situation holds for Feedback Arc Set in 

Tournaments, for which the fastest known algorithms work in time ■ mm- 

For this reason, the question of establishing tight upper and lower bounds for Minimum Fill-In 
was already asked explicitly by Fomin and Villanger [25|, repeated by Marx in his survey on the 
optimality programme [38|, and then reiterated for respective subclasses in all the works [61171 fl^. 
The goal of this paper is to remedy this situation by providing complexity foundations for proving 
that the square root in the exponent of the running time is hard to improve. 

Our results. First, we investigate how strong lower bounds for completion problems can be 
obtained when we assume only ETH. 

Theorem 1.1. Unless ETH fails, there is an integer c > 1 such that there are no 
and consequently no algorithms for the following problems: Minimum Fill-In, 

Interval Completion, Proper Interval Completion, Trivially Perfect Completion, 
Threshold Completion, Chain Completion. 

Here, a graph G is a chain graph if it is a bipartite graph with a fixed bipartition A tt) H, where 
the vertices of A can be ordered by a linear ordering ^ such that N{u) C N[v) whenever u ^ v, 
for all u,v & A. In the Chain Completion problem we are given a bipartite graph with a fixed 
bipartition A tt) H, and we want to add at most k edges between A and B to obtain a chain graph. 
Chain Completion also admits an algorithm with running time Q{ 2 ‘^{Pk\ogk) _|_ [25] . 

Unfortunately, Theorem 11.11 does not provide a tight resnlt for completion problems we are 
interested in: we still have a gap between k^P and k^P. We in fact prove a higher lower bound, 
but we need to ground it on a stronger complexity assumption. More precisely, we consider the 
approximability of the Min Bisection problem on d-regular graphs: given a d-regular graph G 
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with an even number of vertices, find a partition of V(G) into equal-sized parts that minimizes the 
number of edges between the parts. For 0 < a < /3 < 1 , problem Gap Min BiSECTiON((i)[Q^^] is 
defined as the gap problem of deciding whether the optimum solution is at most am, or at least 
fdm. 

Hypothesis 1.2. There exist 0 < a < (3 < 1, and an integer d > such that there is no 

-time algorithm for Gap Min BiSECTiON(fi)[Q,^^]. 

Theorem 1.3. Unless Hvvothesis \1.2\ fails, there is no -time algorithm for Chain Gompletion, 

and no -time algorithms for Minimum Fill-In, Interval Gompletion, Proper Interval 
Gompletion, Trivially Perfect Gompletion, and Threshold Gompletion. Consequently, 
none of these problems can he solved in time 2°^^^ ■ 

Thus, Theorem 11.31 asserts that the existence of a substantially faster algorithm for any of 
the considered completion problems would lead to a breakthrough in the design of approximation 
algorithms for Min Bisection. We do not intend to take a stance on whether Hypothesis 11.21 
is true or false. However, the existence of an algorithm refuted by Theorem 11.21 is far from 
the current knowledge: the best polynomial-time approximation algorithms for Min Bisection 
achieve approximation factor O (log OPT) [10], whereas to refute Hypothesis 11.21 one would need 
to obtain approximation factor arbitrarily close to one. This could be, however, possible due 
to the assumption of d-regularity, for a constant d, as well as the access to subexponential-time 
computations. We are not aware of any approximation algorithms for Min Bisection that would 
significantly use any of these assumptions. In fact, Berman and Karpinski |5| have shown that 
approximating Min Bisection in graphs of maximum degree 3 is as hard as in general graphs 
from the point of view of polynomial time approximation. 

Finally, using the methodology of Theorem 11.11 we can give an improved lower bound for 
Feedback Arc Set in Tournaments. For this problem, the first subexponential FPT algorithm 
with running time was given by Alon et al. [3], which was later improved to 

20 {Vk) . ^O(i) independently by Feige [23] and by Karpinski and Schudy [33] . 

Theorem 1.4. 0 Unless ETH fails, there is an integer c > 1 such that there is no and 

consequently no 2^1^^'^'^/• n^U)-time algorithm for Feedback Arc Set in Tournaments. 

A direct inspection of the classic NP-hardness proofs mm gives only a , and consequently 

also a ■ lower bound, but Theorem fLil is still not tight: we have a gap between 

and similarly as in Theorem 11.11 Unfortunately, despite efforts we are unable to close this 
gap using Hypothesis 11.21 or a similar assumption. 

Our techniques. The main idea of this work is to carefully combine the standard approach 
of starting with a sparse instance of 3- Sat, prepared using the Sparsification Lemma [3l], with 
the existence of almost linear PCPs. More precisely, an application of the Sparsification Lemma 
reduces solving an instance ip of 3- Sat to solving a number of instances ■ ■ ■ ,4>i of 3- Sat 

on the same variable sets, but in each (fi we have that the number of clauses m is linear in the 
number of variables n. Then, to each (pi we apply the result of Dinur m on the existence of 
almost linear-size PCPs. This transforms each pi into an instance p[ of 3- Sat where the number 

^As the reduction in the proof of Theorem [L4] is randomized, here we assume the randomized version of the ETH. 
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of variables and clauses is O(nlog'^n), but the instance is either satisfiable, or at most a p-fraction 
of clauses can be satisfied, for some p <1. 

Having such an instance at hand, we can proceed with the chain of reductions proposed by Garey 
et al. m and Yannakakis [H]. Namely, we first use simple manipulations to obtain an equivalent 
instance of Max Cut with a gap, and then reduce it to the Optimum Linear Arrangement 
(OLA) problem: Given a graph G on n vertices, find a linear ordering vr: V{G) {1,2,... ,n} 
that minimizes the cost defined as YluveE{G) K(^) “ value |7r(u) — 7r(i;)| is also called the 

cost of uv. This is precisely the moment where we exploit that we are working with an instance 
of Max Cut with a gap. Namely, the construction introduces a huge clique to the complement of 
the Max Cut instance; this clique is supposed to separate in the ordering the sides of an optimum 
max-cut solution. In Garey et al. m, its size must be large enough so that the cost of any edge 
jumping over the clique dwarfs the “noise” contribution that is given by internal ordering of parts 
on the left and on the right. Hence, the clique is chosen to be of size 0(n^), which explodes the 
size of the instance. However, we observe that by starting with an instance with a gap, we can 
accommodate the noise in the gap, and therefore we only need the clique to be of linear size. Thus, 
we obtain the following theorem that can be of independent interest. 

Theorem 1.5. Unless ETH fails, for some c > 1 there is no algorithm solving Optimum Linear 
Arrangement in time . 

Note that OLA can be solved by a simple Held-Karp dynamic programming on subsets in time 
2”' • so Theorem 11.51 gives an almost tight result. 

To obtain Theorem 11.11 one jnst applies slightly modified elegant reduction from (the decision 
variant of) OLA to Minimum Fill-In, proposed by Yannakakis [H]. The reduction firstly reduce 
OLA to Chain Completion. Because of this fact later we get a very specific Minimum Fill-In 
output instance. And this immediately gives us a reduction to Proper Interval Completion and 
Interval Completion problems. With slight changes similar result is obtained for Threshold 
Completion and Trivially Perfegt Completion. Theorem 11.41 is derived by applying the 
same methodology to the randomized hardness rednction for Feedbaok Arc Set in Tournaments 
of Ailon et al. [T]. 

The moment when we lose tightness in our chain of reductions is precisely the last reduction 
from OLA to Minimum Fill-In. This is because in the construction we introdnce a new vertex 
for every edge of the original graph, which blows up the vertex set quadratically. Therefore, the 
reduction would give a tight lower bound if we started with a sparse instance of Optimum Linear 
Arrangement. This is precisely our idea behind the proof of Theorem 11.31 

Namely, we observe that by starting from the hardness of Gap Min Bisection (d)[^ y 3 ], instead 
of Max Cut, we can give an alternative hardness reduction to Optimum Linear Arrangement 
in graphs of bounded degree by a constant d (OLA<((i) for short). This reduction is much more 
intricate. Essentially, we replace the usage of a huge clique (which would blow up the degree) by a 
careful construction using several layers of expander graphs that mimics the same behaviour. This 
proves the following resnlt. 

Theorem 1.6. Unless Hypothesis fails, there exists an integer d G N such that there is no 
2°i'G-time algorithm solving OLA<(d) in multigraphs. 

From this. Theorem 11.31 follows similarly as Theorem 11.11 followed from Theorem 11.51 
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Related work. The combination of ETH, Sparsification Lemma, and PCP tools was already 
used in the recent line of work on developing the theory of hardness of FPT approximation; see 
e.g. [8l llll[28ll34j . There, the main goal is to provide lower bounds on how the access to FPT-time 
computations can help in the design of approximation algorithms for fundamental inapproximable 
problems, like Clique or Set Cover. 

The existence of almost linear PCPs combined with ETH was also used by Marx m to 
give almost tight lower bounds for the running times of polynomial time approximation schemes 
(PTASes) for several fundamental problems on planar graphs and in the Euclidean plane. The 
basic principle behind the approach of Marx m is very similar to ours. Namely, it is observed 
that using ETH and PCP tools one can prepare a sparse instance of the maximization variant of 
3-Sat where it is hard to distinguish between a fully satisfiable instance and an instance where 
only a (1 — e)-fraction of the clauses can be satisfied in time for some constant c (see 

also Theorem [Q in this work). This observation is used as a base for further reductions refuting 
the existence of certain approximation algorithms for geometric and planar problems. 

Thus, this work provides another example where the said combination appears to be useful. 
This time we use it to prove improved lower bounds on the complexity of FPT algorithms solving 
certain graph modification problems exactly. The new idea in this work is that the gap property of 
the considered instances can be used not only to exclude the existence of approximation algorithms, 
but also to limit the instance size explosion in a chain of NP-hardness reductions by using more 
thrifty constructions. 

Hypothesis [12] can possibly have links with the hypothesis put forward by Feige [22]. Essentially, 
Feige conjectures the hardness of distinguishing a “typical” 3-Sat instance from a satisfiable one in 
polynomial time. From this, he derives as a corollary a variant of Hypothesis 11.21 but without the 
assumption of d-regularity, for a constant d, and with subexponential time replaced by polynomial. 
It is conceivable that Hypothesis 11.21 can be also implied by some stronger variant of Feige’s 
conjecture, but we refrain from formalizing this link due to many technical problems that arise 
when attempting to do this. 

Outline of the paper. In Section (2] we give all the needed definitions and list the tools used 
in our reductions, such as the Exponential Time Hypothesis and the PCP theorem. Next, in 
Section (3] we prove Theorem 11.51 The main technical contribution of the paper, that is the proof 
of Theorem 11.61 is contained in Section 01 Theorem 11.61 is the main ingredient in the proof of 
Theorem 11.31 that is presented in Section [S] Section [6| is devoted to the proof of Theorem 11.41 

2 Preliminaries 

2.1 Parameterized complexity 

A parameterized problem Q is a subset of S* x N, for a fixed finite alphabet S. An instance 
of the problem Q is an element {x, /c) G S* x N with the integer k called the parameter. A 
parameterized problem is fixed parameter tractable if there exists an algorithm deciding whether 
{x,k) € Q, and working in time f{k) ■ poly(|x|) for every instance {x,k), where / is a computable 
function. For denoting the time of an algorithm we will use the O* notation that suppresses factors 
polynomial in the size of the input, e.g. 0*{f{k)). 

We denote a polynomial deterministic linear reduction from a problem X to T by X Y, i.e., 
X Y means that there is a polynomial time deterministic algorithm, which given an instance 
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I of the problem X produces an instance I' of the problem Y of size 0(|/|), such that / is a 
yes-instance if and only if I' is a yes-instance. 


2.2 Graph notations 

A standard graph theoretical notation is used throughout the paper. A graph G is a tuple 
{V{G), E{G)), where V{G) is the set of vertices and E{G) C is the set of edges. When 

it is clear from the context what graph we refer to, we will denote by V and E the set of vertices 
and edges of G. A graph H is a subgraph of G if V{H) C V{G) and E{H) C E{G). A graph H is 
an induced subgraph of G if V{H) C V{G) and E{H) = E{G) n An induced subgraph of 

G with vertex set X is denoted by G[Ai]. A complement of G is the graph with the vertices V and 
the edge (^) — E and we denote it by G. For X C 1/, we denote by the set of edges with 

exactly one endpoint in X. We define a cut, as the set of edges Eg{A, B), for a partition {A,B) of 
V. We denote the size of the cut by \Eg{A, B)\. 

We often use n, and m to denote the size of V and E, respectively. For a vertex v, deg(^(u) 
denotes the degree (the number of incident edges) of the vertex v. We say that a graph is 
d-regular if the degree of each vertex is equal to d. We denote by Aq the maximum degree of 
G. The set Ng{v) = {tc : {v,w) G E{G)} is the neighbourhood of v. We extend this notation 
to subsets of vertices X, i.e. Ng{X) = \ Ai. We will also omit subscripts, i.e. 

deg{v),A,N{X),6{X),E{U,V), when it is clear from the context, which graph we refer to. 

If X,Y CV are disjoint, then Eg{X, Y) is the set of edges between X and Y. We use G[X,Y] 
to denote the induced bipartite subgraph of G with parts X and Y. That is, G[X,Y] is the graph 
with vertex set X U T that contains precisely the edges Eg{X,Y). 

For X C 1/, we denote by (5g(X) the set of edges with exactly one endpoint in X. We define 
a cut, as the set of edges Eg{A,B), for a partition {A, B) of V. We denote the size of the cut by 
\Eg{A,B)\. 


2.3 Expanders 

The Cheeger number h{G) of a graph G is defined as 


h{G) := min 


Mil 

|X| 


: X C V{G),\X\ < 


Mil 

2 


We say that a graph G is a (d, e)-expander if it is d-regular and has Cheeger number of at least e. 
When discussing expanders, it is convenient to allow parallel edges and self-loops, which naturally 
appear in most of the expanders constructions. It is important to note that a self-loop contributes 
1 to the degree of a vertex. 

There are many efficient constructions of expanders available in the literature. The following 
theorem produces graphs with provably optimal Cheeger numbers: 

Theorem 2.1. Iddlldgj/ Let d = p^ + 1, where p is a prime and A: G N, and let q be a prime congruent 
to 1 mod 4. Assume p ^ q. There exists a {d,^ — y/d — 1)-expander on q + 1 vertices. Furthermore, 
such an expander can be constructed in polynomial time. 


Throughout this paper, we use expanders of various sizes, hence the following theorem appears 
to be useful in our setting, despite providing weaker bound on the Cheeger number. 
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Theorem 2.2. [Theorem 21.19 of Let p > 0 be a real. Then there exists a positive integer d, 
such that for every positive integer n there exists a d-regular multigraph Gn,d on n vertices with 
h{Gn,d) > p. Moreover, graph Gn,d can be constructed in time polynomial in n. 

2.4 Linear arrangements 

A linear arrangement of a graph G = {V,E) is a function (a vertex ordering) vr : L —>■ {1,... ,n}. 
The cost of a linear arrangement vr is defined by YIuv&e We call vr the optimum 

linear arrangement if its cost is minimized over all vertex orderings of G and we denote this cost 
by OLA(G). 

2.5 Satisfiability 

We employ a standard notation related to Sat problems. We use symbols xi, ..., for the variables 
of an instance, and Gi,..., Gm for the clauses. An LCNF formula is ELCNF, if it has exactly I 
literals. We say that an assignment of xi,...,Xn NAE-satisfies an 1-CNF formula if every clause 
contains a satisfied and an unsatisfied literal. An LAND is a conjunction of clauses, where each 
clause is a conjunction of at most I literals. 

2.6 Exponential Time Hypothesis 

The Exponential Time Hypothesis (ETH), introduced by Impagliazzo, Paturi and Zane [301131] is 
now an established tool used for proving conditional lower bounds in the parameterized complexity 
area (see [35] for a survey on ETH-based lower bounds). Intuitively, ETH states that 3-Sat cannot 
be solved in time sub exponential in the number of variables. 

Hypothesis 2.3 (Exponential Time Hypothesis (ETH) [5U]f3T] l. There is no time randomized 
algorithm for 3-Sat. 

Lemma 2.4. (Sparsification Lemma, Jgl]/ ) For every e > 0, there is an algorithm that takes a 
3-CNF formula 4> and returns I 3-CNF formulas (pi,..., (pi, such that: i) I = 0{2^^), ii) for every i, 
(pi has n variables, and every such variable appears in at most clauses of (pi, for some constant 
Ce > 0, Hi) (p is satisfiable if and only if at least one of (pi is satisfiable. The running time of the 
algorithm is 

Consequently, based on ETH we have subexponential hardness of 3- Sat in terms of both the 
number of variables and clauses. 

Theorem 2.5 ([3T]). Unless ETH fails, there is no time algorithm for 3-Sat. 

2.7 Gap problems and PCPs 

In a gap version of a problem, the input instance is promised to belong to one of two languages 
specifying the allowed input, and the goal is to decide which case (language) a given instance belongs 
to. Gap problems are associated with two parameters a, (3. For example, in Gap Max Cutjc,^^], 
we are to distinguish between the case when a given graph admits a cut of size at least (3m and the 
case when a given graph does not admit a cut of size larger than am. Similarly for satisfiability 
problems with gap [a, [3] we are to distinguish between a formula, for which at least (3m clauses 
can be satisfied, and the case where it is impossible to satisfy more than am clauses. 

To introduce gaps in our reductions, we will use the following fundamental result of Dinur. 
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Theorem 2.6. (Almost Linear Size PCP, f77| /j 3 -Sat G PCP^ i(log(n) + C>(loglog(n)), 0(1)) 

3 Combining known reductions for OLA with PCPs and ETH 

In this section we discuss the proof of Theoreni ll.51 i.e., explain how to show that unless ETH fails, 
there is no 2 ''“sHT^T-'-time algorithm for Optimum Linear Arrangement, for some constant 
c G N. We combine several well known concepts from the complexity theory: the ETH hypothesis, 
the PCP theorem, the Sparsification Lemma, gap reductions, and gap amplification. 

First, we show that the combination of the ETH hypothesis, together with the almost linear 
size PCP theorem, and the Sparsification Lemma, gives subexponential hardness of Gap E3-SAT. 
This fact was already observed and used by Marx [37]. 

Theorem 3.1 (see also Lemma 2.2 of m)- Unless ETH fails, there exist c G N, and r G (0,1) 
such that there is no 2 ’-time algorithm for Gap E3-SAT[^ i]. 

Proof. From Theorem 12.61 it follows that there is a PCP^ i(log(ra) + Aloglog(n), H) verifier V for 
3-SAT, for some constants A,B > 0. 

Assume we are given the input E3-CNF formula 4>. Fix any e > 0 and apply Lemma 12.41 to 
(p, obtaining I = 0(2^”) instances (pi, i = l,...,l, each with size bounded by cn, for some constant 
c > 0, depending on e. Let R = log(cn) + Aloglog(cn). 

If we take (pi as the input to the verifier V, then for each random binary string of length at 
most R, it reads at most B bits of the proof (we will assume without loss of generality that 
V always reads exactly B bits). Thus, V has access to N bits of the proof in total, where 
N < B ■ 2^ = B ■ cnlog^{cn). We create variables xi, ...,xn corresponding to each of these bits. 
For a boolean string r of length R, we define a boolean function fr : {0,1}^ {0,1}, with its 

arguments corresponding to relevant variables that can be read from the proof, fr is 

defined as follows, for every S G {0,1}^ it evaluates to true on S if and only if V accepts (pi using S 
as the bits from the proof. Every fr can be written as an equivalent E3-CNF formula Fr (possibly 
adding some constant number of variables). Let C be an upper bound (depending on B) for the 
number of clauses for each such formula. Let (/>( be a formula combined by AND of all Fr. It has 
the number of clauses bounded by C • 2^ and the following holds. 

If (pi is satisfiable, then there is a proof for which V always accepts. By taking the corresponding 
values from this proof and setting them to xi, ...,X]v, we obtain a satisfying assignment of (/>'. 

For the proof in the other direction, if (pi is not satishable, then for any proof for at least half 
of all binary strings r the veriher V rejects (pi. Consider an arbitrary proof and set the variables 
Xi to the corresponding values from this proof. Observe, that for the binary strings r for which V 
rejects (pi at least one clause in Fr at least one clause is false. Consequently, at least ^ • 2^ clauses 
of (p[ are false, i.e., a fraction of at least ^ of the total number of clauses of (p[. 

Finally, let us put c = A + 1, and t = 1 — ^ and suppose there is a 0*{2 iog'^+b™)'^)-time 

algorithm for Gap E3-SATri j_ ,i. Using this algorithm, we can check whether (pi is satisfiable in 

L-*- 20’-^^ 

time 2°^"') and therefore check whether the initial formula (p is satisfiable in time 0*(2'^’^-|-2'^"'-2°("')) = 
Q*( 2 ^ny g -g arbitrary positive number, we obtain a contradiction with the ETH. □ 





Next, we inspect the chain of three textbook NP-hardness reductions |42j . that starts with 
Gap E3-SAT and ends with Gap Max Gut. All of them produce an output instance of linear 
size in terms of the size of the input instance and at the same time preserve gaps. Combining them 
with the previous theorem, we will establish (almost) subexponential hardness of Gap Max Cut. 

Theorem 3.2. 

Gap E3-SAT[„^^] Gap E4-NAE-SAT[„_^] 

Gap E3-NAE-SAT,i+c i+zs, <p” Gap Max Cut.i6+c 

Proof. For simplicity, the proof of Theorem 13.21 is split into the following four separate lemmas. 

Lemma 3.3. 

Gap E3-SAT[„_^] <f Gap E4-NAE-SAT[„_^] 

Proof. Given an E3-CNF formula cf = Gi A ... A Cm, we create a new E4-CNF formula ff = 
CJ A ... A C'm by adding a new variable z to every clause. That is, if Ci = liV hV I3, then we set 
C' = /i V /2 V /s V 2. 

If some assignment satisfies k clauses of (f, then by additionally setting z = 0 the corresponding 
k clauses of (j)' are NAE-satisfied. In the other direction, if some assignment (p NAE-satisfies k 
clauses of cj)', then its negation (p also NAE-satisfies k clauses of cf)'. W.l.o.g. assume that p sets z 
to false, which means that p restricted to the variables of (j) satisfies k clauses of (f. Consequently 
we obtain a gap preserving reduction and the theorem follows. □ 

Lemma 3.4. 

Gap E4-NAE-SATr„ Gap E3-NAE-SAT.^ 1 +^, 

Proof. Given an E4-CNF formula 4> = Ci A ... A Cm, we add m new variables zi, .■■, Zm, and replace 
every clause Ci = liV I2V Z4 with the following two clauses: = /i V ^2 V Zi, C'f = /s V ^4 V ^Zi., 

obtaining an E3-CNF formula cf'. In the following we show that one can NAE-satisfy at least k 
clauses of (f if and only if one can NAE-satisfy at least m + k clauses of (f', which suffices to prove 
the theorem. 

First, observe that if an assignment NAE-satisfies Cj, then by setting Zi appropriately both C[ 
and C'f become NAE-satisfied. On the other hand if an assignment does not NAE-satisfy C*, then 
setting Zi to an arbitrary value NAE-satisfies exactly one clause out of C[ and C'f. Consequently if 
there is an assignment, which NAE-satisfies k clauses of 4>, then it can be extended to an assignment 
of variables of cf', which NAE-satisfies m + k variables of . 

Let us assume that there is an assignment p^ which NAE-satisfies at least m + k clauses of 
cj)'. Note that there is a set I of at least k indices i, such that p NAE-satisfies both C' and C'f. 
Consider a hxed f G / and w.l.o.g. assume that p sets Zi to false. As p NAE-satishes both C'^ and 
C", we infer that p sets at least one of the literals li, I 2 to true, and at least one of the literals I 3 , U 
to false. Consequently p restricted to the variables of cj) NAE-satishes all the clauses Ci for i G I, 
which hnishes the proof of the theorem as |/| > A:. □ 

Lemma 3.5. 


Gap E3-NAE-SATrQ <p" Gap Multigraph Max Cut r3+2a 3+2/3 i 

^ 6 ’ 6 J 
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Proof. Given an E3-CNF formula (j) = Ci A ... A Cm, let Uj be the number of occurrences of Xi in 
4>, both in the positive and negative form. We construct a multigraph G with 2n vertices, and 6m 
edges as follows. For each variable Xi, we create two vertices corresponding to literals Xi, ^Xi, and 
add exactly edges between them. Finally, for every clause Ci we add a triangle connecting the 
vertices corresponding to its literals. We will prove that there is an assignment which NAF-satisfies 
at least k clauses of cf if and only if G admits a cut of size at least 3m + 2 k. 

Let us assume that there is an assignment which NAF-satisfies k clauses of (f. We put vertices 
of G, that correspond to literals evaluated to true, on one side of the cut, and remaining literals to 
the other side of the cut. Fvery edge connecting Xi and -iXj is clearly in the cut, and all of them 
contribute ~ Furthermore, every triangle corresponding to a NAF-satisfied 

clause has exactly two edges cut, all of them contribute 2k to the cut. Thus, we obtain the cut of 
size at least 3m + 2k. 

Let us assume that G has a cut of size at least 3m -|- 2k. First, suppose that Xi and -ix* are on 
the same side of the cut for some variable x,. They contribute at most n* edges to the cut, and we 
can move one of the vertices Xi or -iXj to the other side of the cut without decreasing the number 
of edges in the cut. Thus, we can assume that variables are separated from their negations. The 
edges connecting Xi and -ix, contribute exactly 3m to the cut. If a clause triangle is cut 2 times, 
then it corresponds to a NAF-satisfied clause, otherwise the clause is not NAF-satisfied as all its 
literals have the same value. Thus, we deduce that the considered cut corresponds to an assignment 
which NAF-satisfies k clauses of (f. □ 

Lemma 3.6. 


Gap Multigraph Max Cutu <p" Gap Max Cut r 2+Q 2+/3 i 

L 5^1 i 3 ’ 3 J 

Proof. Given a multigraph G, for every edge e = uv oi G, we create two auxiliary vertices We,Ze, 
add three edges uwe, WeZe, ZeV, and remove the edge uv, obtaining a resulting graph G'. We claim 
that G has a cut of size at least k if and only if G' has a cut of size at least 2m -|- k. 

Let C be a cut of G of size k. If e = uu 0 C, we put We and Zg to the side of the cut opposite to 
u. This way two of the new edges, i.e., uWe and ZgV, are in the cut. If e = uv £ C, then putting w^, 
Ze to the opposite sides of u, v, respectively, gives three edges in the cut. All together we obtain a 
cut of G' of size 2m -|- k. 

Let C he a cut of G' of size 2m -|- k. Clearly, there are at least k edges uv of the original graph 
G, such that all the three edges uWe, WeZe, and ZeV are in the cut C. Note that for such an edge uv 
the vertex is on the side opposite to u, Zg in on the side opposite to and v is on side opposite 
to Ze, which means that v in on the side of the cut opposite to u. Consequently a restriction of C 
to vertices of G gives a cut of size k. □ 

Lemmas 13.3113.4113.51 and 13.61 together prove Theorem 13.21 □ 

Thus, we infer: 

Theorem 3.7. Unless ETH fails, there exist c G N, and 0 < a < /? < 1 such that there is no 
2 _time algorithm for Gap Max Cutjqjj]. 

Finally, we modify the reduction by Garey et al. m from Max Cut to OLA. The construction 
from m introduces a huge clique to the complement of the Max Cut instance; this clique is 
supposed to separate in the ordering the sides of an optimum max-cut solution. The clique has to 
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be large enough so that the cost of any edge going over the clique eclipses the “noise” contribution 
that is given by internal ordering of parts on the left and on the right. For this reason the clique is 
chosen to be of size 0 (n^), which imposes a significant blow-up in the instance size and immediately 
prevents from obtaining a desired subexponential hardness result. 

This is precisely the moment where we exploit that we are working with an instance of Max Cut 
with a gap, as starting with a gap instance, we can accommodate the noise in the gap, and therefore 
we only need the clique to be of linear size. Note that in the following theorem the number of vertices 
produced is linear, however the produced instance might be dense, hence we cannot use the notation 
<p“^ and formulate the reduction properties explicitly. 

Theorem 3.8. There is a polynomial time algorithm, which given an instance I of Gap Max CuT[q,^^] 
with n vertices produces an instance I' o/OLA with 0{n) vertices, such that if I admits a cut with 
at least jdm edges, then I' is a yes-instance, and if I does not have a cut with at least am edges, 
then I' is a no-instance. 

Proof. Given the input graph G, we create a corresponding instance of OLA, G' as follows. We 
take a complement of G, i.e., G and add a disjoint clique G of size Mn, for M = and fully 

connect it to G. 

We will show that G has a cut of size at least /3m if and only if G' has a linear arrangement of 
cost at most _ jjjji. Mn, that will prove the equivalence of the instances. 

Let {A,B) be a cut of size at least /3m. We define a linear arrangement vr as follows. First, we 
list the vertices of A, then we list the vertices of the clique C and then the vertices of B. The order 
of vertices inside A, B and C can be arbitrary. Then by counting the costs of all the edges in the 
clique on (M -|- l)n vertices we obtain; 

|7r(u)-7r(u )|Y -vr(u)| = ^V*) 

uveE(G') uveE{G) ^ ^ 

Now, the costs of each edge going across the clique, i.e., each edge in Eg'{A, B), is at least Mn, 
thus Y1uv&e(g) ■ Ain, and it implies the desired inequality. 

In the other direction, let vr be a linear arrangement of G' of cost at most — jdm- Mn. 

First, we will prove that there exists an optimum linear arrangement vr, such that the vertices of 
G has to be in the same consecutive block of vr. Before that, we will introduce notation regarding 
to linear arrangements, also used in other proofs. 

We assume the vertices of G' are ordered from left to right according to a linear arrangement vr. 
When we speak about the i-th vertex (from the left), we mean the vertex mapped to the number 
i by TT. A set of vertices U is consecutive in tt, if ^{11) = {p,p -\- 1,... ,q — l,q} for p, (7 G N. The 
set of all vertices that are to the left of every vertex from some set U is called vertices to the left 
of U and denoted by L{U). Similarly, we define vertices to the right of U and denote them by 
R{U). A block of U is any inclusion-wise maximal non-empty subset of U that is consecutive in vr. 
The left-most block of U is the block of U whose vertices are mapped to the smallest values by tt. 
Second left-most block of U is the first block of U to the right of the left-most block of U. Inner 
block ofU is the set of all vertices from V{G') \ U located simultaneously to the right the left-most 
block of U and to the left of the second left-most block of U (in the case when U forms a single 
block, the inner block does not exist). 

Claim 3.9. There exists an optimum linear arrangement tt of G' such that the vertices of G are 
consecutive in vr. 
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Proof. Let us choose vr to be an optimum linear arrangement of G' that minimize the number of 
vertices of G that lie between the vertices of G, i.e. /(vr) = \{v (G) : tt{u) < 7r{v) < 7r(w},u, w € 

V{G)}\ is the smallest possible. We claim that the vertices of C are consecutive in vr, i.e. /(vr) = 0. 

Assume that this is not the case. Let X be the inner block of C and we consider the following 
two cases: 

. \Eg{L{X),X)\ < \Eg{X,R{X))\ 

. \Eg{L{X),X)\ > \Eg{X,R{X))\ 

In each of them, we will create another linear arrangement that will contradict the assumptions 
we made. 


G' 


Eg>{L{X),X) Eg'{X,R{X)) 



Figure 1: The situation in the proof of Claim 13^ 

If the first case occurs, let G' be the left-most block of C. We swap places of G' and X. 
Then, we consider how it affects the cost of vr. First, we note that by (*), the optimality of 
vr can be equivalently restated as maximizing the sum of cost of edges of G, i.e. vr is optimal if 
YluveE{G) \'^{u) — Tr{v)\ is the largest possible. We claim that the value of this sum has not decreased. 
Clearly, the cost of edges with both endpoints in V (G) \ X or Ai does not change. Hence, it is 
enough to inspect the edges with exactly one endpoint in X. Every such edge goes from X to L{X) 
or from X to R{X). An edge going from X to L{X) has decreased its length by |G|, and an edge 
going from X to R{X) has increased its length by |G|. Thus, as \Eg{L{X), X)\ < \Eg{X, R{X))\, 
the overall contribution of those edges has not decreased, but the number of vertices between G 
has dropped by |X|, leading to the contradiction. 

If the second case occurs, let G' be the second-left most block of G. Similarly we swap places 
of G' and X. Again, only affected edges are the ones that have exactly one endpoint in X. The 
inequality \Eg{L{X), X)\ > \Eg{X, R{X))\ imples that YluveE{G) ~ ^(^)l increased at 
least by |X|, contradicting the optimality of vr. 

Thus, we have proved that G has to be consecutive in vr. 

□ 

Let vr be the optimum linear arrangement from the claim above. We define a cut {A,B), by 
taking A to be L{C) and B to be R{G). Then by (*) we know: 

I3m-Mn< |7r(rt) — vr(v;)| = 

uveE{G) 
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Thus: 


|7r(M) — 7r(u)| + |7r(u) — 7r(u)| + |7r(n) — 7r(u)| < 

uvGE{G), uv^E{G), uv^E{G), 

u,v£A u,vGB u^A,v^B 

nm + (Mn + n) • B)\ < 2nm + Mn ■ \Eg{A, B)\ 

‘2,'rn 

-6)1 > 13m — > /3m — {/3 — a) ■ m = am 

By the promise given by the gap problem we know that if G admits a cut greater than am, then 
it actually admits a cut of size at least /3m. □ 

The proof of Theorem 11.51 follows immediately from Theorems 13.71 and 13.81 


4 Sparse reduction 

We now introduce a polynomial-time Turing reduction from a gap version of the Min Bisection 
problem on d-regular graphs to Optimum Linear Arrangement. Its key property is that the 
instances of the former problem result in instances of Optimum Linear Arrangement with 
linear number of vertices and bounded degree. Even though the created instance of Optimum 
Linear Arrangement is a multigraph, it does not cause any additional difficulties in further 
reductions described in Section 0 The reduction allows one to distinguish between instances of 
Min Bisection with at most am edges and at least /3m edges in the optimum cut for some fixed 
choice of0<a</3<lby solving the resulting instance of Optimum Linear Arrangement. 
This relates Optimum Linear Arrangement to the following hypothesis: 

Hypothesis II.21 There exist 0 < a < /3 < 1, and an integer d > such that there is no 

2^'^^-time algorithm for Gap Min BiSEGTiON(d)[Q^^]. 

The main result of this section is: 

Theorem II.61 Unless Hypothesis L.HI fails, there exists an integer d £ N such that there is no 
2°^^^-time algorithm solving OLA<((i) in multigraphs. 

We hrst describe a transformation T(-) from an instance G of Min Bisegtion to an instance 
of Optimum Linear Arrangement that forms the key component of our reduction. Then, we 
introduce several technical claims about its properties. Finally, we prove Theorem 11.31 bv showing 
how to decide the instances of Gap Min Bisection based on the cost of the optimum arrangement 
of T{G). The fact that our reduction exhibits only a linear increase in the size of the instance is 
crucial in achieving the bound. 

The result of transformation T(-) is influenced by several parameters. The choice of their values 
is deferred to the proof of Theorem 11.31 Consider an instance G of the Min Bisection problem, 
where G is a dc-regular graph. Assume V{G) = {vi,... ,Vn}. The transformation produces a 
graph G' := T{G) with the vertex set {ui,..., Vn,xi, ..., Xz.\^pn'\}-: where Z £ N and ip £ (0,1) are 
constants chosen later. Note that G' contains the vertices of G. Indeed, we are going to construct 
the edge-set in such a way that G is an induced subgraph of G'. It is actually convenient to 
introduce notation for some of the induced subgraphs of G'. The subgraph with the vertex set 
{xi,... ,xz.\ipn'\} is denoted by H. The graph H is (arbitrarily) divided into Z disjoint induced 
subgraphs Hi of size \(pn~\ each, for some constants Z € N and ip £ (0,1). 

The result of the transformation is illustrated in Figure [2J The edge-set of G' is constructed as 
follows: 
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Figure 2: The resulting instance G = T(G) after applying the reduction. The original graph G is 
an induced subgraph of G with its edges shown in green. The edges of the bipartite graph added 
between V{G) and V{Hi) are shown in blue. 


• The induced subgraph of G on {ui,..., is G. 

• We construct a dj^-regular expander G\h\4h using Theorem 12.21 satisfying h{G\H\,dH) — 
and add its edges on the vertices of H (the value pn will be determined later). 

• For each i E Z} we construct a d^i-^sgular expander G\H^\^dH. using Theorem 12.21 

satisfying h{G\H.\ dH ) — PHi^ und add its edges on the vertices of Hi (the value pui will be 
determined later). 

• For each i E Z} we add a bipartite graph on parts V{G) and V{Hi) such that all 

vertices of V{G) have degree 1 in this bipartite graph and the degrees of vertices from V{Hi) 
differ by at most 1. We denote the maximum degree of the V(Hi) part of this added bipartite 
graph by Ah^g- It is at most 

Note that we are constructing a multigraph, that is when an edge is to be added several times 
in the construction process, we keep all its copies. 

In the following part, we give the proof of correctness of the transformation, as well as determine 
the parameters driving the reduction. In the proof, we first show that the vertices of H have to be 
consecutive in an optimum linear arrangement. Next, we show that actually vertices of each small 
expander Hi are consecutive in such an ordering. This is crucial when we analyze the change of 
the cost of the ordering when moving a vertex of G from one side of H to the other, and in turn 
prove that in an optimum ordering the parts of G to the left and to the right of H are almost of 
the same size. Interestingly, in our reduction we have to use the hypothetical oracle solving the 
decision version of Optimum Linear Arrangement to find the cost of an optimum ordering of 
an auxiliary graph by using binary search. 

The constructed G is influenced (apart from the input graph G) by our choice of parameters 
Z, <p,ph and p//., which in turn influence dn and d//. by Theorem l2.2l The lemmas below impose a 
particular structure on the optimum linear arrangement of G' , provided certain inequalities between 
these parameters are satished. Eventually, the lemmas are employed in the proof of the main 
theorem of this section. 

The following technical Swapping Lemma establishes a condition on degrees in two consecutive 
sets of G under which the swapping of the two sets results in a decreased cost of the ordering. 
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Lemma 4.1 (Swapping Lemma). Consider an ordering tt of any finite graph G'. Assume that the 
sets X,Y C V{G') are consecutive and X immediately precedes Y. Let L := L{X) and R := R{Y). 
Assume 

• the value Px upper bounds the degree of vertices from X in the induced bipartite subgraph 
G’[L,X], 

• Pc is an upper bound on the maximum degree of G'[X,Y], and 

• Py is an upper bound on the degree of a vertex from Y in G' [Y, i?]. 

Finally, let p be a lower bound on the average degree of a vertex from X in G'[X,R]. Then the 
inequality p > Px + 2Pc + Py implies that swapping the vertices of X with the vertices of Y in the 
order specified by tt results in a decrease in the cost of the ordering. 


G' 


Px 


Py 



L := L{X) 


X 


Y 


R := R{Y) 


Figure 3: The vertex set of G' is partitioned into four sets, L,X,Y, and R, in Lemma l4.ll The 
bounds Px, Py, Pc (upper bounds), and p (a lower bound) on the degrees of the induced bipartite 
subgraphs are also shown. 


Proof. The situation is illustrated in Figure [3j The length of all edges connecting a pair of vertices 
from one of the sets L,X,Y,R remains unchanged after swapping X and Y in the ordering. The 
same holds for edges connecting L with R. The length of each edge connecting X and Y increases 
by at most \X\ + |y| < 2max{|X|, |y|}. The cost of each edge connecting X and L increases by 
at most |y|. Similarly, the cost of each edge connecting Y and R increases by at most |y|. On 
the other hand, the edges connecting X and R are shortened, each by |y|. The upper bounds on 
maximum degrees and the lower bound on average degree from the statement of the lemma allow 
us to lower bound the decrease in the cost of the ordering after the swap is performed. For example, 
the decrease in total cost of the edges connecting X with R is at least p|y||y|. The decrease in 
cost after swapping is at least 

p\X\\Y\ - 2mm {\X\,\Y\}Pc ma^{\X\,\Y\} - \X\Px\Y\ - \Y\Py\X\, 


which is equal to 


\X\\Y\{p-2Pc-Px-Py). 


Assuming the inequality from the lemma, this is strictly larger than zero. 


□ 
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We now make several claims about the optimum ordering vr of G' := T{G), where G is a 
de-regular graph. Recall that that Z, if, PH,PHi, dn, and dui are the parameters of the transformation 
T(-) still to be determined. 

Lemma 4.2. IfpH > + 3Z + dg and vr is an optimum linear arrangement of G', then V{H) 

is consecutive in vr. 

Proof. Suppose V{H) is not consecutive in vr. Consider the left-most block of V{H) and denote its 
elements by X. We can assume that \X\ < - otherwise we take the right-most block of V{H) 

and proceed with a mirrored version of the following argument. Denote by Y the inner block of 
V{H) and set L := L{X),R := R{Y). The following choice of values satisfies the assumptions on 
degree upper-bounds of the Lemma ICT 

Px ■= ^H,G, Py ■= da + Z, Pc ■■= Xh,g + Z. 

Since H is an expander, |X| < |id|/2 and H \ X <Z R, we take p = pn- It remains to show the 
inequality from the statement of the Swapping Lemma. We have: 

P = PH > 3Ah,g + SZ + dc = Px + ‘2'Pg + Py- 

Thus, we can swap X and Y and decrease the cost of the ordering. This contradicts the optimality 
of vr. □ 

Lemma 4.3. Leti G {1,..., Z}. IfpHi > dHi+i+^dH+‘2-AH,G, is an optimum linear arrangement 
of G', and for eaeh i' < i the vertiees of Hi/ are conseeutive in vr, then the vertices of Hi are 
eonseeutive in vr. (For the purposes of this lemma, we set dn^+i ■= 0 J 

Proof. Assume Hi not to be consecutive and denote by X the left-most block of V{Hi) in vr. 
Similarly to the situation in the proof of Lemma 14.21 we can assume \X\ < |idj|/2. (Otherwise we 
consider the right-most block instead and proceed with a mirrored version of the argument.) We 
show that moving X to the right decreases the cost of the arrangement. 

Denote by u the vertex positioned by vr immediately to the right of X. Due to Lemma 14.21 we 
know u ^ V{G). Therefore, u G V{Hj) for j i. We distinguish two cases: either j < i or j > i. 

Suppose that u G V{Hj) for j < i. Note that Hj is consecutive in vr. We set Y := V{Hj), L := 
L{X), and R := R{Y). Again, we employ the Swapping Lemma. The following degree upper-bounds 
satisfy its assumptions: 

Px := AH,G + dH, PY:=AH,G + dH, Pc-.= dH- 

Since Hi is an expander, | A| < \Hi\/2 and V(Hi)\X C R, we can set the average degree lower-bound 
p to pHi- By the inequality from the statement of this lemma, we have 

P = PHi > ^dn + 2 Ah,g = Px + 2Pg + Py- 

Thus, the inequality from the Swapping Lemma holds and we can use it to decrease the cost of 
ordering, contradicting the optimality of vr. 

Suppose therefore that u G V{Hj) for j > i. We now use Lemma l4.II again to move the block 
X one position to the right, effectively swapping X and Y := {u}. We set L := L{X),R := R{Y). 
This time, we set: 

Px ■= Ah,g Ydn, Py ■= Ah,g + dn + dui+i , Pg ■= dn- 
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Similarly to the previous cases, we set p := pHi- The inequality from the Swapping Lemma is again 
satisfied: 

P = PHi > + dHi+i = Px + 2Pc + Py- 

Once more, we get a contradiction with the optimality of vr. 

□ 

Dne to Lemma 14.21 we know that an optimum linear arrangement of G' places vertices of H 
consecutively, assuming the inequalities from its statement are satisfied. Furthermore, by iterating 
Lemma [4.31 we get that within H, the vertices of each Hi are grouped together in the arrangement. 
To be precise, in the optimum ordering the subgraphs Hi are placed in the order Hi^, ..., H(^^, 
where (-^i,^ 2 , • • • j ^z) is a permutation of { 1 , 2 ,..., Z}. The vertices of G can thus be only to the 
left of H or to its right. The next lemma shows that H divides the graph G into two roughly equal 
parts. 

Lemma 4.4. Assume G' has been constructed by the transformation T(-) with parameters satisfying 
the inequalities from the statements of Lemmas \4-S\ and \4-3\ as well as Zcp > 2. Moreover, assume 
7 = Sipdc- Consider an optimum linear arrangement tt of G' and set A := L{H),B := R(H). 
Then ||A| — |i?|| < yn. 

Proof. Assume the imbalance ||A| — |i?|| is strictly bigger than yn. Withont loss of generality, 
assume |A| > \B\. We consider the vertex u snch that 7 r(n) = 1 (i.e., the one placed on the left side 
of the arrangement). 

Moving u to the right-most position results in the following changes in the cost of the arrangement. 
The cost associated with the edges of G might be increased by at most dciZlipn] +n) < dG{Zipn + 
Z + n). Note that for snfficiently large n we have > Z and by the assumption of the 

Lemma we have > n, therefore the cost assosiated with the edges of G increases by at most 
dciZipn -|- Z + n) < 2 dGZipn. 

In addition to this, the vertex u is connected to precisely one vertex Vi of each H^.. Before 
moving u, the edge cost of uvi was |A| + (i — l)\ipn~\ + r*, for some 0 < r* < \^pn~\ — 1 and after, it 
becomes |i?| + ((Z — i + l)\ipn~\ — r*), thus the contribution of all those edges has changed by: 


{\B\ -\A\)Z+ '^{{Z-i + l)\^pn] -ri)-'^{{i-l)\ipn] +ri) = {\B\ - \A\)Z + -2ri) < 


i=l 


i=l 


i=l 


-jnZ + Z\ipn]'^ -McZipn + Z\ipn~\ -2dGZipn 

Note that in the last inequality we have used the assumption < dGpn which holds for dG > 
and suffiently large n, as is a constant. 

Combining this cost change with the cost increase of edges of G being at most 2dGZipn, we 
obtain that moving u to the right-most position causes the decrease in the cost of vr, leading to a 
contradiction. □ 

Being equipped with all the required tools, we are ready to prove Theorem 11.61 

Proof of Theorem \1.(A We prove the theorem by introducing a Turing reduction from Gap Min 
Bisection to OLA<((i'), where 

4 


d> 


P — a 


( 1 ) 
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and d! is some constant depending on d, a, and (3. The reduction proceeds in the following way. 

Given an instance G of Gap Min Bisection we create an equivalent instance {G',k) 
of OLA<(d') based on the transformation T(-) applied to G. The value k will depend on, among 
other parameters, the cost of the optimal arrangement of the expander H (recall this is an induced 
subgraph of G'). As mentioned in the beginning, the reduction is a Turing reduction. It is therefore 
equipped with an oracle solving the decision version of OLA, which in turn is enough to find the 
cost of an optimal ordering by using binary search (we will use it to compute OhA{H)). 

We start by establishing the parameters of the transformation in the following order. 

• First, we set 7 := and ip := 3 ^. Note that 7 ,(^ E (0,1) and 7 ,(/? satisfy the condition 
7 = Sipdc from Lemma 14.41 


• Next, we set the value of Z to the following integer: 


2(2« + l) 
■ ' (/3 - a)p, 


)!• 


In particular Z > ^ satisfying the condition from Lemma 14.41 Moreover, the following 
inequality holds: 


2{2a + l) < {13 - a)Zip. (2) 

• By construction Ah,g < 

• Next, we define pn, which in turn determines the value of du by Theorem 12.21 

PH := 3Ah,g + 3Z + dG + I ■ 

The additive term +1 is just to make sure the inequality from Lemma 14.21 is strict and it 
could be replaced by any positive constant. 

• Finally, we set the values pHi ■ 


PHi ■= dHi+i + Mh + 2,Ah,g + 1, 

in the decreasing order i = Z,..., 1, where for simplicity we denote duz+i = 0. Note that the 
value of PHi determines the value of dni via Theorem 12.21 

Observe that with the above definition of all the constants we satisfy all the requirements of 
Lemmas 14.2114.3114.41 

Let G' be the result of the transformation T{G) with the above choice of parameters. The value 
k is then set as follows: 

z 

k := OLA(FI) + am ■ {Z\ipn'\ + n) + m • ^ + (^(^ + 1)^-^ + nz|"v?n]) . (3) 

i=l 

Note that to compute the value of k we need use the oracle solving the decision version of T in a 
binary search routine. It remains to show that G has a bisection of size at most am if and only if 
G' has an optimum linear arrangement of size at most k. 
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Let us assume G has a bisection with at most am edges. We claim that k is an upper bound on 
the cost of an optimum linear arrangement of G'. This is because it accounts for all costs associated 
with an ordering of G' constructed from the optimum bisection of G. Denote by A, B the partition 
of V (G) corresponding to an optimum bisection of G. We create an ordering vr that first lists all 
vertices of A, then the vertices of H in the order of an optimum linear arrangement of H, and 
finally the vertices of B. 

The first term of ([3]) is the cost of all edges inside H. The second term upper bounds the cost 
of edges of G between A and B: there are at most am of them and we upper bound the cost of 
every such edge by |D(G')| = {Z\ipn\ + n). In the third term, we account for the cost of edges 
within A and within B. There are at most m of them and since |T| = \B\ = ^ every such edge has 
cost at most The last term is an upper bound on the cost of edges connecting G and H. Every 
vertex u of G has an edge to exactly one vertex of each If u G ^ we may bound its cost by 
j{v) + i\ipn\ , where j{v) is the length of the part of the ordering from v to the first vertex in H. We 
first count the contribution of the j(u)-terms in the above expression for all choices of u G T. Since 

1^1 = n/2, summing over all u G ^ and i = 1,..., Z we get ~ '^~ 2 —situation is 

analogous for B. The last term of (l3|) is obtained by summing the remaining edge costs i\(pn\ for 
all z = 1,..., Z and v G G. This proves the claimed upper bound. 

In the other direction, we start with assuming that the graph G' has an optimum arrangement 
TT of cost at most k. The aim is to prove that G has a bisection of size at most am. Lemma 14.21 
Lemma 14.31 and Lemma 14.41 together impose a certain structure on vr. Particularly, the vertices of 
H are placed together in vr. We use this fact to construct a bisection of G. Set A := L{H) and 
B := R{H), and if |j4| < \B\, then replace A with B. Note that {A,B) is a partition of V{G) 
and these two sets might have different sizes with the imbalance bounded by Lemma 14.41 We now 
bound the number of edges between A and B. To this end we lower bound the cost of vr in terms 
of \Eq/{A, B)\. Specifically, it is at least: 


z 

OLA{H) + \Eg'{A,B)\ -Zlipn] + (^{^ + 1)^Z + n'^{i - . (4) 

i=l 

There, the first term of 0 accounts for the cost of the edges of H, as it is lower bounded by 
OLA(Lf). The second term of ([!]) is a lower bound on the cost of edges of G going across the 
partition {A,B). There are \Eg'{A, B)\ of such edges and each contributes at least Z\ipn] to the 
cost. Recall that vertices of H must have the following order ,..., , where ,..., 

is some permutation of {1, 2,... , Z}. The third term lower bounds the cost of edges connecting G to 
H. Every vertex u of G has an edge to exactly one vertex from E[£i. Similarly to the analysis above, 
we lower bound the cost of such an edge by j{v) + (i — \)\ipn\. The contribution of j(u)’s from 

the above expression for all choices of u G G is equal to (jZflij + > (2E|=ii)^ > 

(^ + 1)^Z. The remaining part is obtained by summing (i — l)\(pn\ over all u G G and i = 1,..., Z. 
Comparing ([3]) with dH) we obtain: 


\Egi{A,B)\ ■ Z\(.pn\ < am ■ {Z\{pn\ + n) + m ■ —h n i\ipn\ — — 1)1"= 

^ i=i i=i 

2(2a + l)mn hy ^ {(3 — a)m-Zipn 

am ■ Z I ipn \ H-h n 2_^ \ \ < am ■ Z \ (pn \ - \- n ■ Z \ pn \ 


i^n< [i/Jn] 
< 


i=l 
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am ■ Z\ipn\ + 


{j3 — a)m 


Z\(pn] + n • Z\(pn] 




a+ 3(5 


m ■ Z\ipn] 


We infer that \Eg>{A, B)\ < Next, we create a bisection {A!, B') of G as follows. Let C 

be a set of arbitrary vertices of A, then we put A'= A — C,B'= BUC. By moving these 

vertices we get at most ^ ■ do = 'jm additional edges in the cut, thus: 


\Eg'{A',B')\ < \Eg'{A,B)\ + 7 m < + ^-^rn = (3m. 


Therefore, we have \Eg>{A', B')\ < (3m. Since the problem considered is a gap problem, we know 
that G admits either a bisection of size at most am or there is no bisection of size less then (3m. 
Therefore, we conclude the instance admits a bisection of size at most am. 


□ 


5 Lower Bounds for Minimum Fill-in and Other Completion 
Problems 

In this section we prove Theorems 11.11 and 11.31 that is prove conditional lower bounds (under 
ETH and under Hypothesis [L2]) for parameterized completion problems such as Minimum Fill-In, 
Chain Completion, Proper Interval Completion, Interval Completion, Threshold 
Completion, Trivially Perfect Completion. As a starting point we use Theorems 11.51 
and Mi hence our goal is to transform an instance of OLA into an instance of a graph completion 
problem. The main reduction of this section, which transforms an instance of OLA to Chain 
Completion is a slight modification of the reduction of Yannakakis [H], with the only difference 
that on bounded degree instance of OLA we obtain linear number of vertices in the final instance 
of Chain Completion. This fact is crucial to prove Theorem 11.31 while for proof of Theorem ll.il 
it is enough to use the original version of Yannakakis reduction. 

Definition 5.1. A bipartite graph (A,B,F) with vertices A+l B and edges E is a chain graph if 
the set of vertices A (called left side) can he ordered vi,V 2 ,... ,Vn (called left order) in such a way 
that N{vi) C N{v 2 ) C • • • C N{vn). 

In the Chain Completion problem given a bipartite graph (A, B, F) one is asked to add a 
minimum number of edges F' + A x B\F such that (A, B,F D F') is a chain graph. 

Lemma 5.2. There is a polynomial time algorithm, which given an instance I = {G = {V,E),k) 
o/ Optimum Linear Arrangement creates an equivalent instance I' = {G' = {A, B, F), k') of 
Chain Completion, such that the number of vertices of G' is bounded by 0 (Ag ■ |P|); where Ag 
is the maximum degree of G. The reduction works even if G is a multigraph. 

Proof. As the left side of G' we take A = V. For each vertex v G V create a set of Ag new vertices 
Sv = {ve '■ e £ U {vi : deg(u) < i < Ag}. We define B as the union of all the sets S^, thus 

B contains exactly Ag • \ V\ vertices. The set of edges F is constructed as follows. For each w & Sy 
we add to F an edge vw. Additionally, for each Ue G H, where e £ E,e = uv we add to F an edge 
uve, so that the vertex Vg is of degree exactly two in G'. The described transformation is depicted 
in Fig. m To finish the construction of I' we define k' = k + Ag ”'*'^ 2 ~^^ “ 2|F|. 
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(a) OLA instance G (b) Chain Completion instance G' 


Figure 4: Transformation of Optimum Linear Arrangement to Chain Completion 

For a given ordering vr of the vertices of the graph G = {V,E) denote by C{G,Tr) the cost of 
arrangement induced by this ordering (i.e., G{tt,G) = J2uveE K('“) “ ordering a of 

the left side A of the bipartite graph G' = {A,B,F) denote by E{G',a) the number of edges that 
we should add to obtain a minimal chain graph in which the left order coincides with a. We will 
prove the following claim. 

Claim 5.3. For any orderings ofV (or equivalently A) we have E{G' ,tt) = (7(G, tt) + ~ 

2\E\. 

Proof. Let us inspect what is the number of edges in a minimal chain bipartite graph G" which 
has left order vr = {vi,V 2 , ■ ■ ■ ,Vn) and contains G' = {A,B,E) as a subgraph. We know that 
N{vi) C N(vj) for any i < j, which means that each x G must be connected to all vertices from 
the set {uj, Ui+i,..., Vn}- So it means that each vertex from Sv^ is connected to at least (n + 1) — i 
vertices from the set A. Moreover, by minimality of G" the vertex x is connected to exactly this 
number of the vertices if x does not correspond to any edge in G, i.e., when x is of degree exactly 
one in G'. If a vertex Wi G corresponds to some edge e G G with endpoints Vi,Vj (note there 

could be several edges with equal endpoints as we are working with multigraphs) then the vertex 

Wi is connected to vertices Vi,Vj in the graph G'. Hence, in a minimal chain graph G” the degree 
of Wi is either (n + 1) — i or (n + 1) — j = {n + 1) — i + {i — j) depending on whether i < j oi i > j. 
Note that there is a second vertex wj G S^. which also corresponds to the edge e. The degrees of 
Wi and Wj in G” both are equal to (n + 1) — i or (n + 1) — j, depending whether i < j oi i > j. In 
both cases the sum of degrees Wi, wj in G" can be written as ((n + 1) — i) + ((n + 1) — j) + \i — j\- 
Hence, for each edge e with endpoints Vi,Vj we have additional cost of \i — j\. Summing up, we 
infer that the number of edges in G” equals 

^G(^((n + l)-i))+ N - Jl = +C(G,7r). 

i ViVjGE 

The number of added edges equals the number of edges in G" minus the number of edges in G'. 
So we add exactly 

(A^ n(n+I) ^ + 2\E\) = Ag ^^^ 

edges. □ 
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Graph class name 

Forbidden induced subgraphs 

Chordal 

Proper Interval 
Interval 
Threshold 
Trivially perfect 

Cn for n > 4 

claw, 2-net, 3-tent, Cn for re > 4 

bipartite claw, umbrella, re-net for re > 2, re-tent for re > 3, Cn for re > 4 

2K2,C4,P4 

C4,Pa 


Table 1: Characterization of graph classes by forbidden induced subgraphs 


Equivalence of the instances I and I' follows from the claim, and it proves the lemma. 

□ 


Having an instance of Chain Completion we transform it further to an instance of other 
completion problems by simply making A a clique, or by making both A and B cliques. By 
inspecting the forbidden subgraphs definition of each graph class we infer the equivalence of the 
instances, which is enough to prove Theorems 11.11 and ll.31 



(a) claw 


o-o-a 

(b) P4 




(g) n-net, n > 2 


(h) n-tent, n > 3 


Figure 5; Forbidden induced subgraphs for various graph classes 


Classes of chordal, interval, proper interval, threshold, trivially perfect graphs have many 
characterizations. For our purposes the most convenient one is by the set of forbidden induced 
subgraphs [12]. The characterization is presented in Table 1. 

Lemma 5.4. There are polynomial time reductions from Chain Completion problem to Minimum 
Fill-In, Interval Completion, Proper Interval Completion, Threshold Completion, 
Trivially Perfect Completion problems and these reduetions do not change vertex set. 

Proof. For any bipartite graph H' = (C/i, U2,F) consider a graph Ch{H') = (C/i, U2, FU {uv\u, v G 
Ui} U {uv\u,v G 1^2})- III IS] it is shown that any bipartite graph FT is a chain graph if and only 
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if Ch(H') is a chordal graph. It means that in order to reduce an instance of Chain Completion 
H = {A, B, F) to Minimum Fill-In it is enough to construct cliques on sets of vertices A and 
B. So the constructed graph is a union of two cliques and some edges between cliques. Hence any 
arbitrarily completion of this graph does not contain a claw, bipartite claw, umbrella, p-net, g-tent 
for p > 2, g > 3 as these graphs have an independent set of size 3 and a union of two cliques does 
not. It follows that solutions for Minimum Fill-In, Proper Interval Completion,Interval 
Completion problems on such instances coincide and we can look at this reduction to Minimum 
Fill-In as a reduction to Proper Interval Completion or Interval Completion problems. 

It is left to show a reduction from a Chain Completion instance to Threshold Completion 
and Trivially Perfect Completion instances. Having a Chain Completion instance with a 
bipartite graph H = {A,B,F) we consider Trivially Perfect Completion and Threshold 
Completion problems on the graph G = {Au B,F U {{u,v)\u,v E H}). We just add edges such 
that A becomes a clique. We show that a minimum chain completion of the graph H corresponds 
to a completion towards trivially perfect or threshold graph. Let F' be a solution of Chain 
Completion for the graph H. Consider graph G' = {Au B,F U {{u,v)\u,v E H} U F'), G' is a 
union of independent set and clique plus some edges between independent set and clique. So it 
does not contain induced 2 K 2 or G^. If G' contains induced P 4 = U 1 U 2 U 3 U 4 then U 2 ,U 3 belong to 
clique and ui, U 4 to independent set. However, this contradict to fact that edges between clique and 
independent set form a chain completion. As G' does not contain induced 2 K2 ,Pa, it is trivially 
perfect and threshold graph. Let now F' denote solution of Trivially Perfect Completion 
or Threshold Completion on instance G'. To finish the proof of correctness of reduction it is 
enough to show that (A, B,{FD F') n E{A, B)) is a chain graph. If this graph is not a chain graph 
then it must contain two independent edges uiU 2 ,U 3 U 4 [H]. However in such case graph on vertices 
I'l) 'i' 2 ; T 3 , U 4 will induce a P 4 in graph G'UF' which contradict to the fact that G'UF' is a threshold 
or trivially perfect graph. 

To sum up all our reductions from Chain Completion to Minimum Fill-In, Proper Interval 
Completion, Interval Completion, Threshold Completion, Trivially Perfect Completion 
add some edges to a graph of Chain Completion instance and do not change a vertex set of a 
graph. □ 

At this point we almost proved Theorem 11.11 and Theorem 11.31 

Theorem II.IL Unless ETH fails, there is an integer c > 1 such that there are no 
and consequently no algorithms for the following problems: Minimum Fill-In, 

Interval Completion, Proper Interval Completion, Trivially Perfect Completion, 
Threshold Completion, Chain Completion. 


Proof. If the statement is not true then for some of the problems there is an algorithm running 
in time Having instance of OPTIMUM Linear Arrangement on n vertices we can 

reduce it to problem under consideration with (Ac + l)u = 0{n‘^) vertices by Lemmas 15.21 and 15.41 
This gives us an time algorithm which contradicts Theorem 11.51 As 

k <n^ we also have 2 ^ 1 ^^^"*/lower bound on the running time. □ 


Theorem ll.3L Unless Hypothesis H . gl fails, there is no -time algorithm for ChA m Completion, 

and no 2°^"^^-time algorithms for Minimum Fill-In, Interval Completion, Proper Interval 
Completion, Trivially Perfect Completion, and Threshold Completion. Consequently, 
none of these problems can be solved in time 2 °^^^ • 
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Proof. In Section U] we transformed a d-regular Min Bisection instance to Optimum Linear 
Arrangement instance with bounded degree. Pipelined with Lemma [5. 2 1 we get a reduction from 
d-regular Min Bisection to a Chain Completion instance with 0{n) vertices and edges. So 
2°(”'+™')-time algorithm for Chain Completion contradicts Hypothesis 11.21 By Lemma (5^ we can 
reduce Chain Completion to Minimum Fill-In, Interval Completion, Proper Interval 
Completion, Trivially Perfect Completion, and Threshold Completion instance without 
changing the vertex set. Combining all three reductions in one we get reductions from Min 
Bisection to Minimum Fill-In, Proper Interval Completion, Interval Completion, Threshold 
Completion, Trivially Perfect Completion problems which transform an instance with n 
vertices into an instance with 0{n) vertices. This leads to lower bound for all discussed 

completion problems as well as to 2^^^^ • lower bound because k < 11 ?. □ 

6 Hardness of Feedback Arc Set in Tournaments 

In this section we prove Theorem ll.4l that is, the lower bound on the complexity of Feedback Arc 
Set in Tournaments. We start with preparing an appropriately hard instance of Feedback Arc Set 
in general digraphs, so that we can apply the reduction of Ailon et al. [T]. 

6.1 Preparing a hard instance of FAS 

By (E3i^i,E2d^d)-SAT we denote the version of 3-Sat where every variable has 

• exactly 1 positive occurrence in a clause of size 3, 

• exactly 1 negative occurrence in a clause of size 3, 

• exactly d positive occurrences in clauses of size 2, 

• exactly d negative occurrences in clauses of size 2, and 

• there are no clauses of size 1. 

Similarly as before, Gap (E3ip,E2rf rf)-SATj^ for 0 < a < /3 < 1 is the problem of distinguishing 
whether the maximum number of clauses that can be satisfied in a given instance of (E3i^i,E2rf^rf)-SAT 
is at most am, or at least /3m, where m is the total number of clauses. We now give a hardness 
result for Gap (E3i,i,E2rf^rf)-SATj^^^j. 

Lemma 6.1. There exists a positive integer d such that 

Gap E3-NAE-SATr„ 11 Gap (E3i i,E2rfrf)-SATri+.+3d 

^ ^ ’ ’ I 2+3d PI 

Proof. Let (f be the input E3-CNF formula. For every variable x, let n{x) be the number of 
occurrences of x in (f. Let Gx be a d-regular expander with n(x) vertices, obtained using Theorem l2.2l 
for p = 2; suppose vertices of Gx are numbered 1 , 2 ,..., n{x). Create i/i as follows: 

• Replace x with n{x) new variables xi,X 2 ,..., Xn(x)^ corresponding to the occurrences of x in 

• For every ij G E{Gx) with i A 2, introduce two clauses (-iXj V Xj) and (xj V ^Xj), which (if 
unbroken) force the evaluation of Xi to be equal to that of Xj ; 
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• For every ii G E{Gx), introduce a trivial clause {^Xi V Xj); 

• Perform the same construction for every other variable of <p] 

• For every 3-clause C of (/>, introduce two new clauses C and C" to V'- Clause C" is constructed 
from C by replacing each occurrence of some variable x with the new variable x* corresponding 
to this occurrence; the polarity of the literals in C is the same as in C. Clause C is 
constructed from C by reversing the polarity of each literal. For example, if C* = (x V “■y V z), 
then C' = (xi V -ly^ V Zk) and C = (-ix* V yj V ^Zk), where i,j, A: are the indices of the 
occurrences of x, y, z in C, respectively. 

It is easy to see that every variable of the new formula V' appears in exactly two 3-clauses, once 
positively and once negatively. Also, it has exactly 2d occurrences in 2-clauses: d positive and d 
negative. Since the original formula had only clauses of size 3, the new formula has exactly 3m 
variables and m{2 + 3d) clauses. 

If there is a variable evaluation A for (p that NAE-satisfies all clauses of (f>, then we can construct 
a variable evaluation A' for ip by assigning all the variables Xi that originate in variables x the value 
A(x). Then it is easy to see that A' satisfies all the clauses of ^p. 

Suppose now that every variable evaluation for (p NAE-satisfies at most am clauses, and for 
the sake of contradiction suppose that there is a variable evaluation A' for ip that satisfies more 
than (1 -|- a -|- 3d)m clauses. Let x be a variable of (p, and let us modify A' as follows: assign all the 
variables xi,X 2 ,... ,Xn(x) the value that is taken by the majority of these variables in the original 
evaluation A' (breaking ties arbitrarily). Observe that since h{Gx) > 2, this step cannot decrease 
the number of satisfied clauses: if q is the number of variables out of xi,X 2 ,... ■,Xn(x) that take 
the minority value, then by replacing their values by the majority value we can unsatisfy at most 
2q 3-clauses, but we satisfy at least h{Gx) • q > 2q 2-clauses that were previously unsatisfied. By 
performing this operation for every variable of (p, we can assume without loss of generality that in 
X' all the variables originating in the same variable of cp are assigned the same value. This naturally 
defines a variable evaluation A for (p. Then, provided A' satisfied more than (1 -|- a -|- 3d)m clauses 
of V’) we infer that A NAE-satisfies more than am clauses of (p. This is a contradiction. □ 

We now turn our attention to the Feedback Vertex Set in general directed multigraphs: 
given a directed multigraph G, find the smallest possible subset of vertices X such that G — V is 
acyclic. By FVS-BAL((i) we denote the variant of FVS where the input directed multigraph has 
no loops, and is 2(i-regular and balanced, i.e., the indegree and the outdegree of every vertex is 
equal to d. Again, Gap FVS-BAL((i)j^ is the gap problem where we need to distinguish between 
the cases when the optimum size of X is at most an and at least jSn, where n is the number of 
vertices of the multigraph. 

Lemma 6.2. 


Gap (E3i,i,E2rf,rf)-SAT[^^,j Gap FVS-BAL(d + 2)[,/2 

Proof. Let (p be the input instance of (E3iq,E2£;,^)-SAT, and let n be the number of variables of 
(p. Then m, the number of clauses of (p, is equal to (2/3 -|- d)n. 

Construct a directed multigraph G as follows. For every variable x of (p, create two vertices 
Uj and u*, corresponding to setting x to true and false, respectively. Add edges (uy,tt*) and 
(tt* ,Uy) to the edge set, for every variable x of (p. Moreover, for every 2-clause of (p add a 2-cycle 
between vertices corresponding to its literals (e.g. clause x V -■?/ gives rise to edges {u^,u\) and 
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and similarly for every 3-clause of (j) add a 3-cycle between vertices corresponding to its 
literals, oriented arbitrarily. Note that in this manner trivial clauses of the form (-ix V x) give rise 
to additional copies of the 2-cycle (uy, u* )(tty, u^). This concludes the construction of G. It is 
easy to verify using the assumed properties of (j) that every vertex of G has indegree and outdegree 
equal to d -|- 2, and moreover in the construction we did not introduce loops. Let n' = 2n be the 
number of vertices in G. 

Suppose first that there exists a variable evaluation A for cj) that satisfies all the clauses of (j). 
Define X to be the set of all the vertices for x being a variable of (j)] note that |X| = n = 

n'/2. Since A satisfies all the clauses, and every variable of (p participates in exactly one 3-clause 
positively and in exactly one 3-clause negatively, then it is easy to see that all the weakly connected 
components of G — X are either isolated vertices or single edges. Thus, G — X is acyclic. 

Assume now that every variable evaluation for cp satisfies at most am clauses, and for the sake 
of contradiction suppose that there exists a set X with |X| < such that G — X is 

acyclic. Observe that from each pair {uy,Uy} at least one vertex has to belong to X. Define a 
variable evaluation A for cp as follows: if |{My,My} n X| = 1, then A(x) is such that £ X, 

and otherwise A(x) is chosen arbitrarily. Observe that the first alternative holds for a set of more 
than n — variables; let us denote them by S. Since G — X is acyclic, each of 2- and 3-cycles 

constructed for a clause C of (p has at least one vertex from X. If all the variables of G belong to 
S', then it can be easily seen that this implies that A satisfies G. Hence, the only clauses of (p that 
can be unsatisfied by A are the ones that contain at least one variable outside S. Every variable of 
(p occurs in at most 2d -|- 2 clauses, and there are less than • n variables outside S, which means 
that A unsatisfies less than clauses. Hence the fraction of unsatisfied clauses is less 

{l-a){2d + 2)n _{l-a){2d + 2) 

3m (2 -|- 3d) “ 


This is a contradiction. 


□ 


Finally, there is a well-known reduction that reduces Feedback Vertex Set to Feedback Arc Set 
in the directed setting. This reduction appears to preserve the gap. In the following, by Gap FAS(d)j^ 
we denote the problem of determining, for a given directed multigraph G without loops whose 
underlying undirected multigraph is d-regular, whether the minimum number of edges that needs 
to be removed from G to make it acyclic is at most am or at least fdm, where 0 < a < /3 < 1 and 
m is the number of edges in G. 

Lemma 6.3. 

Gap FVS-BAL(d)r Gap FAS(d + l). . ,5 , 

^ D+i’d+il 

Proof. Let G be the input directed multigraph, and let n and m denote the numbers of edges and 
vertices of G, respectively; by the assumption that G is 2 d-regular we know that m = dn. Construct 
a graph G' as follows: 

• For every u G V{G) create two vertices u~ G and an edge {u~,u^) G E{G'); 

• For every edge (n, u) G F'(G), create an edge (tt+,u“). 

This concludes the construction of G'. Let Ei, E 2 be the sets of edges constructed in the first and 
second bullet point, respectively. Since G was 2 (i-regular and balanced, we infer that every vertex 
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U+ has outdegree d and indegree 1, whereas every vertex u~ has outdegree 1 and indegree d. Thus, 
G' has n' = 2n vertices and m' = m + n = {d+ l)n edges. 

Suppose first that X is a subset of vertices of G with size at most an such that G — X is 
acyclic. Let F = {(n+,n“) \,u £ X} C Ei. Then it can be easily seen that G' — F is acyclic, and 
\F\ = |X| < an = -^^m'. 

Assume now that every subset X FV{G) for which G — X is acyclic has size at least fdn, and 
for the sake of contradiction suppose that there is a set F C E{G') such that G' — T is acyclic 
and |F| < . Observe that if F contains some edge G E 2 , then we could modify F 

by removing {u^,v~) from F and adding to F (unless it is not already contained in F, in 

which case we do not add any edge to F). This operation can only decrease the number of edges 
in F and preserves the property that G' — T is acyclic; this is because after removing v~ 

becomes a sink. Thus, without loss of generality we can assume that F Q Ei. Let X be the set of 
vertices u G V{G) for which G F. Since G' — T is acyclic, it easily follows that G — X is 

also acyclic. Moreover, |X| = \F\ < -^j^rn’ = /3n. This is a contradiction. □ 

Finally, observe that in an instance of FAS without loops one can subdivide every edge once, 
which doubles the number of edges while not changing the size of the optimum solution. Thus, 
application of this reduction to the gap problem shrinks the gap twice and makes the directed 
graph at hand simple: it has no loops, no parallel edges, and moreover if [u, v) is an edge then 
(v, u) is not. By combining this observation with Theorem 13.II and Lemmas 13.3113.4116.1116.21 and l6.31 
we obtain the following result. 

Theorem 6.4. Unless ETH fails, there exist 0<a</3<l, c>l, and d > 0 such that there is 
no 2 algorithm for Gap FASjq,^^] on directed simple graphs of maximum total degree d. 

6.2 Reducing FAS to FAST 

Theorem II.41 Unless ETH fails, there is an integer c > 1 such that there is no 
consequently no algorithm for Feedback Arc Set in Tournaments. 

Proof. We provide a randomized reduction that essentially reiterates the argument of Ailon et 
al. [T]. In the analysis, we use the known fact that for any directed graph H, fas(R) is equal to 
the minimum possible number of edges oriented backwards (called feedback edges) in an ordering 
of vertices of V{H), where fas(R) is the optimum size of a feedback arc set in H. Thus, we may 
equivalently think of the FAS problem as finding an ordering tt of F (H) that minimizes the number 
of feedback edges. For an ordering vr of F (H), by fas(R, vr) we denote the number of feedback edges 
in the ordering vr. 

Let a, (5, c, d be the constants given by Theorem 16.41 and let G be an instance of Gap FAS[„ ^ 3 ], 
where G is a simple directed graph with n vertices and m edges and has maximum total degree d. 
Note that w.l.o.g. we may assume ni > n, as otherwise there is a vertex in G with no outgoing 
edges, which can be safely removed. Let us fix an integer k, to be determined later. We consider the 
k-blow up Gk defined as follows: for every tt G F(G) we create a k vertices ui,U 2 , ■ ■ ■ ,Uk in Gfc, and 
for all 1 < i,j < A: we put {ui,Vj) G E{Gk) if and only if (tt, n) G E{G). Thus, vertices tt* are twins. 
Ailon et al. [T], based on a communication by Alon, argue that there is an optimum ordering for G^ 
which may be obtained by taking an optimum ordering for G and replacing every vertex tt G F(G) 
by a block consisting of vertices {tti}i<j<fc in any order; hence in particular fas(Gfc) = k'^ ■ fas(G). 
For an ordering a of F(G), let be an ordering of F(Gfc) constructed in the manner descibed 
above. 
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Construct a tournament Tk from Gk by adding edges between every pair of vertices that are 
not connected by an edge in Gk, where the orientations of these edges are chosen independently 
and uniformly at random. Observe that \E{Gk)\ = • |i?(G)| < dk‘^n/2. Let Rk = {V{Tk), E{Tk)\ 

E{Gk)) be the directed graph consisting only of the edges picked at random. Then for a sufficiently 

large n we have that \E{Rk)\ = (" 2 ^) — \E{Gk)\ > ^^ 4 ^ , and of course \E{Rk)\ < ■ 

We now prove that with high probability, Feedback Arc SET(T/i,) is closely related to 
Feedback Arc SET(Gfc), because the number of feedback edges that need to be chosen from the 
edges picked at random is concentrated around the expected value. 

Let us fix some ordering TT of y(Gfc), then fas(rfc, vr) = fas(Gfc, 7 r)+fas(i?fc, ^r). Fore G E{Rk), let 
Xe be the indicator random variable having value 1 if e is a feedback edge w.r.t. vr, and 0 otherwise. 
Let also X = X^eeE(Ri,) then MX = Let r] = Since X^-s are independent, from 

the Chernoff bound we obtain that 


Pr(|A — EA| > rjk'^n) < 2 exp 


2rf‘k'^'n?‘\ 

\E{Rk)\) 


< 2 exp(— 


Suppose now that there exists an ordering vr of V{G) that has at most am feedback edges. 
Then, with probability at least 1 — 2exp(—we have that 

fas(rfc,7rfc) < ak^m + \E{Rk)\/2 + Tqk^n < ^ ^ + \E{Rk)\/2. (5) 

o 

Hence, if we can set A: to be a large enough constant, such that conclusion ([5|) holds with probability 
at least 3/d. 

Suppose now that fas(G) > /3m. Then, for a hxed ordering a of V{Gk) we have that with 
probability at least 1 — 2exp(—dr/^A:^) it holds that 

ias{Tk,cr) > ias{Gk,(T) + \E{Rk)\/2 — rjk'^n > k‘^ias{G) + \E{Rk)\/2 — r]k‘^n 

> k"^f3m + \E{Rk)\/2 —'qk'^n >— ^ ■ k‘^m + \E{Rk)\/2. ( 6 ) 

O 

We would like to infer that with high probability this conclusion holds for all the possible orderings a, 
and for this we will use the union bound. Observe that the number of orderings a oiV(Gk) is {nk)\ = 
exp{0{nk log{nk))), while the probability of failure for each of them is at most 2 exp(—dr/^A:^). Since 
r/ is a positive constant, simple computations show that by setting k = 0(nlogn), we have that 
{nk)\ ■ 2exp(—d? 7 ^A:^) < 1/d, and hence conclusion ([^ holds simultaneously for all orderings a with 
probability at least 3/d. 

Suppose now that Feedback Arc Set in Tournaments admitted an algorithm with running 

time 2 ^ for c' = c + ^, where c is as in Theorem 16.dl Apply this algorithm to the constructed 

tournament Tk to compute fas(rfc). In case fas(G) < am, then with probability at least 3/d we 
have that ias{Tk) < • k'^m + \E{Rk)\/2. In case fas(G) > /3m, then with probability at least 

3/d we have that fas(rfc) > ■ k‘^m + \E{Rk)\/2. Since , these two alternatives are 

disjoint and the algorithm can, with double-sided error, resolve the input instance of Gap FASjq,^^]. 

Since |F(rfc)| = 0(n^logn), this procedure runs in time 2 Logc/n/L This is a contradiction with 
Theorem 16. dl □ 
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7 Conclusions 


In this work we have given evidence that can be the final answer for 

the running times of parameterized algorithms for Minimum Fill-In, Interval Completion, 
Proper Interval Completion, Trivially Perfect Completion, Threshold Completion, 
and Chain Completion. This evidence is based on a new complexity hypothesis connected to the 
hardness of approximation for the Min Bisection problem. Thus, the answer given by us is not 
completely satisfactory: the lower bounds that we can give only under the assumption of ETH are 
much weaker. Rather, our results uncover a surprising link between the parameterized algorithms 
for Minimum Fill-In and related problems, and the approximability of Min Bisection. Thus it 
seems that the question about the optimality of the former has a much deeper, fundamental nature. 

Therefore, we believe that our work strongly motivates further investigation of Hypothesis 11.21 
Can this conjecture be linked to ETH and possibly some other strong conjectures like SETH, the 
existence of linear PCPs, or the conjectures proposed by Feige [22]? Or maybe it can be simply 
disproved? 

Our improved lower bound for Feedback Arc Set in Tournaments still has a gap between 
and A closer inspection of the proof uncovers a fundamental obstacle for why we cannot 
achieve tightness: the Chernoff concentration bound used in the proof of Theorem ll.4l is essentially 
tight, because in every tournament on n vertices there is a feedback arc set of size ( 2 )/2— |29j . 
If this error term was of magnitude 0(n) instead of 0(n^/^), then our approach would give a tight 
result for FAST. Can this problem be circumvented, or maybe the high anticoncentration of the 
number of feedback edges in a random ordering of a tournament can be exploited algorithmically 
to obtain a faster algorithm? 

Finally, even assuming Hypothesis 11.21 we do not get tight bounds, due to the (poly)logarithmic 
factors in the exponent describing the running time of the existing algorithms for completion 
problems. Bridging this gap can be seen as another forthcoming goal. 
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A Problem definitions 


Optimum Linear Arrangement (OLA) 

Input: A graph G = {V, E), an integer k. 

Question: Does there exist a linear arrangement vr of G of cost at most kl 


Optimum Linear ARRANGEMENT<(d) (OLA<(d)) 

Input: A graph G = {V, E) with degree at most d, an integer k. 

Question: Does there exist a linear arrangement tt of G of cost at most /c? 
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Max Cut 

Input: A graph G = (V, E), an integer k 
Question: Does there exist a cut of size at least kl 


Gap Max CuT[q,^^] 

Input: A graph G = (D, E). 

Case 1: G admits a cut of size at least /3m. 

Case 2: G does not admit a cut of size larger than am. 


Min Bisection 

Input: A graph G = (V, E) with even number of vertices, an integer k. 
Question: Does there exist a cut {A,B) of size at most k, such that |A| = \B\? 


Gap Min Bisection 

Input: A d-regular graph G = {V, E) with even number of vertices. 

Case 1: G admits a cut iA,B) of size at least /3m, such that |A| = \B\. 

Case 2: G does not admit a cut {A,B) of size larger than am, such that |A| = \B 


A.l Satisfiability problems 

We consider several variants of the satisfiability problem, in general defined as follows. 


X-Sat 

Input: An X-Sat formula (/> = Ci A ... A Gm - 

Question: Does there exist an assignment of the variables of (f), such that (/> is satisfiable? 


X-Sat[^^^] 

Input: An X-Sat formula (/> = Ci A ..., AGm - 

Case 1: cj) admits an assignment satisfying at least /3m clauses. 

Case 2: cj) does not admit an assignment assignment satisfying more than am clauses. 


Where an X-Sat formula is a formula from the Sat related problem, precisely it is an E/-CNF 
formula for EESat, EZ-NAE-Sat, and an Z-AND formula for Z-And-Sat. 

We also similarly define a problem X-Sat (cZ) with the difference that the variables of an input 
formula occur in at most d clauses, e.g. Z-And-Sat (d). 

A.2 Completion problems 

The following problems is a generic version of a completion problem to a given graph class X. 
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X-COMPLETION 

Input: An undirected graph G, an integer k. 

Question: Is it possible to add at most k edges to G, so that the obtained graph belongs to 
the graph class X? 


We consider the following list of completion problems: Minimum Fill-In, Chain Completion, 
Proper Interval Completion, Interval Completion, Threshold Completion, Trivially 
Perfect Completion, where Minimum Fill-In is completion to chordal graphs, and other 
problems have self-descriptive names. 
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